-
Notifications
You must be signed in to change notification settings - Fork 119
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[develop] Change conda env for AQM on Hera to shared one and fix input data issue on WCOSS2 #844
[develop] Change conda env for AQM on Hera to shared one and fix input data issue on WCOSS2 #844
Conversation
@chan-hoo I'm currently working on running the I do have a question about the closing of issue #684. Should we keep this issue open until all machines have AQM-specific tasks like WCOSS2, Cheyenne, and Hera and all use |
@MichaelLueken, I agree with you. Since AQM is not available on other machines, you can close the issue once this PR is merged. |
@chan-hoo Since AQM can only run on Cheyenne, Hera, and WCOSS2, I will go ahead and keep the issue set as closable when this PR is merged. |
@chan-hoo , @MichaelLueken - is it only the issue with staged data for running the AQM on other machines?.. I'm still working on moving the AQM data to the rest of the systems, so it should not be an issue when finished. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@chan-hoo I was able to successfully run the aqm_grid_AQM_NA13km_suite_GFS_v16
WE2E test on Hera. I do have one concern before moving forward with approving this PR. I noted that Orion also has AQM tasks, but it is still using miniconda_online-cmaq
rather than miniconda_regional_workflow_cmaq
.
Please update the Orion tasks to use the new miniconda_regional_workflow_cmaq
, then I can approve this PR.
@natalie-perlin, you don't have to hurry. Besides, AQM is not available on other machines yet. |
@MichaelLueken, updated! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@chan-hoo Thank you very much for making these changes on Orion! Approving now.
@chan-hoo - what is needed to make AQM available on all the machines? |
@natalie-perlin, I think so. The AQM input data are necessary and their paths should be defined in ush/machine/[machine].yaml. In addition, the module files in modulefiles/tasks/[machine]/[task] should be updated. |
@chan-hoo - then it would be quite simple to update to the rest of the platforms! I already transferred data to most of the platforms, in ./UFS-SRW_data/aqm_data/ and Note about Orion: |
@natalie-perlin, the issue is that the input data of NEXUS and point source is huge (NEXUS > 100TB, point source > 100GB). You can check their size on Hera (/scratch2/NCEPDEV/naqfc/RRFS_CMAQ/emissions/nexus). I don't think AQM will need the entire directory of NEXUS, but you'll need help from ARL. |
@chan-hoo - |
NOAA's Air Resources Laboratory. They maintain the nexus and point source directories. |
@natalie-perlin @chan-hoo We have actually been trying to get the system up and running and most of the input data is probably already transferred for the GEFS/ UFS-Aerosol work in the fix directory. But we also maintained a backup on the each system for development work. We have the emissions here on Orion: |
@bbakernoaa - thank you for the info! |
@bbakernoaa @chan-hoo Is it correct that we do have all the data on Hera and Orion for running AQM? Cheyenne and other systems have [partial] data from Chan-Hoo in EPIC-maintained space in .../UFS_SRW/data/..., but we may need other (nexus) as well. |
The coverage WE2E tests were manually ran on Orion and all successfully passed:
Awaiting successful completion of the rest of the automated tests before merging this work to develop. |
The automated Jenkins tests have successfully passed on Cheyenne, Hera, and Jet. With the success of the manual run on Orion, all tests have successfully passed. Given this and the two approvals for this work, I will now move forward with merging this work to develop. |
DESCRIPTION OF CHANGES:
TEST_AQM_INPUT_BASEDIR
) to the shared directory for WE2E.TESTS CONDUCTED:
ISSUE:
CHECKLIST
CONTRIBUTORS:
@natalie-perlin